Exact and approximate discrete optimization algorithms for finding useful disjunctions of categorical predicates in data analysis

نویسندگان

  • Endre Boros
  • Vladimir Menkov
چکیده

We discuss a discrete optimization problem that arises in data analysis from the binarization of categorical attributes. It can be described as the maximization of a function F (l1(x), l2(x)), where l1(x) and l2(x) are linear functions of binary variables x ∈ {0, 1}n, and F : R2 −→ R. Though this problem is NP-hard, in general, an optimal solution x∗ of it can be found, under some mild monotonicity conditions on F , in pseudo-polynomial time. We also present an approximation algorithm which finds an approximate binary solution x2, for any given 2 > 0, such that F (l1(x∗), l2(x∗)) − F (l1(x), l2(x)) < 2, at the cost of no more than O(n log n + 2C/ √ 2n) operations. Though in general C depends on the problem instance, for the problems arising from binarization of categorical variables it depends only on F , and for all functions considered we have C ≤ 1/√2. Acknowledgements: This research was partially supported by the National Science Foundation, Grant IIS-0118635, by the Office of Naval Research, Grant N00014-92-J-1375, and by the Rutgers Distributed Laboratory for Digital Libraries, a Strategic Opportunity Project of Rutgers, the State University of New Jersey.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PERFORMANCE COMPARISON OF CBO AND ECBO FOR LOCATION FINDING PROBLEMS

The p-median problem is one of the discrete optimization problem in location theory which aims to satisfy total demand with minimum cost. A high-level algorithmic approach can be specialized to solve optimization problem. In recent years, meta-heuristic methods have been applied to support the solution of Combinatorial Optimization Problems (COP). Collision Bodies Optimization algorithm (CBO) a...

متن کامل

Pareto-optimal Solutions for Multi-objective Optimal Control Problems using Hybrid IWO/PSO Algorithm

Heuristic optimization provides a robust and efficient approach for extracting approximate solutions of multi-objective problems because of their capability to evolve a set of non-dominated solutions distributed along the Pareto frontier. The convergence rate and suitable diversity of solutions are of great importance for multi-objective evolutionary algorithms. The focu...

متن کامل

Non-Fourier heat conduction equation in a sphere; comparison of variational method and inverse Laplace transformation with exact solution

Small scale thermal devices, such as micro heater, have led researchers to consider more accurate models of heat in thermal systems. Moreover, biological applications of heat transfer such as simulation of temperature field in laser surgery is another pathway which urges us to re-examine thermal systems with modern ones. Non-Fourier heat transfer overcomes some shortcomings of Fourier heat tran...

متن کامل

A novel technique for a class of singular boundary value problems

In this paper, Lagrange interpolation in Chebyshev-Gauss-Lobatto nodes is used to develop a procedure for finding discrete and continuous approximate solutions of a singular boundary value problem. At first, a continuous time optimization problem related to the original singular boundary value problem is proposed. Then, using the Chebyshev- Gauss-Lobatto nodes, we convert the continuous time op...

متن کامل

Testing Soccer League Competition Algorithm in Comparison with Ten Popular Meta-heuristic Algorithms for Sizing Optimization of Truss Structures

Recently, many meta-heuristic algorithms are proposed for optimization of various problems. Some of them originally are presented for continuous optimization problems and some others are just applicable for discrete ones. In the literature, sizing optimization of truss structures is one of the discrete optimization problems which is solved by many meta-heuristic algorithms. In this paper, in or...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 144  شماره 

صفحات  -

تاریخ انتشار 2004